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FACETS OF THE BALANCED MINIMAL EVOLUTION POLYTOPE. 


STEFAN FORCEY, LOGAN KEEFE, AND WILLIAM SANDS 


Abstract. The balanced minimal evolution (BME) method of creating phylogenetic 
trees can be formulated as a linear programming problem, minimizing an inner product 
over the vertices of the BME polytope. In this paper we undertake the project of 
describing the facets of this polytope. We classify and identify the combinatorial 
structure and geometry (facet inequalities) of all the facets in dimensions up to 5, and 
classify even more facets in all dimensions. A full set of facet inequalities would allow 
a full implementation of the simplex method for finding the BME tree-although there 
are reasons to think this an unreachable goal. However, our results provide the crucial 
first steps for a more likely-to-be-successful program: finding efficient relaxations of 
the BME polytope. 


1. Introduction 

The goal of phylogenetics is to take a set of related items- biological examples are 
usually referred to as taxa: populations, species, individuals or genes-and to construct a 
branching diagram that explains how they are related chronologically. The diagram we 
will be concerned with is a binary tree with labeled leaves. In other words, a cycle-free 
graph with nodes (vertices) which are either of degree one (touching a single edge) or 
degree three, and with a set of distinct items assigned to the degree one nodes-the leaves. 
We study a method called balanced minimal evolution. This method begins with a given 
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Figure 1. There are two combinatorial types of facets of the 5- 
dimensional BME polytope, shown here as Schlegel diagrams. The poly¬ 
tope on the left is the 4-simplex, and on the right is the 4d Birkhoff 
polytope. The Schlegel diagram for the latter is expanded in Figure 11. 
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set of n items and a symmetric (or upper triangular) square n x n dissimilarity matrix 
whose entries are numerical dissimilarities, or distances, between pairs of items. From 
the dissimilarity matrix the balanced minimal evolution (BME) method constructs a 
binary tree with the n items labeling the n leaves. The BME tree has the property 
that the distances between its leaves most closely match the given distances between 
corresponding pairs of taxa. 

By “most closely match” in the previous paragraph we mean the following: the 
reciprocals of the distances between leaves are the components of a vector c, and this 
vector minimizes the dot product c ■ d where d is the list of distances in the upper 
triangle of the distance matrix. 

More precisely: Let the set of n distinct species, or taxa, be called S. For convenience 
we will often let S = [n] = {1, 2, ..., n}. Let vector d be given, having (”) real valued 
components dij, one for each pair {i,j} C S. There is a vector c{t) for each binary 
tree t on leaves S, also having ( 2 ) components Cij(t), one for each pair {i,j} C S. 
These components are ordered in the same way for both vectors, and we will use the 
lexicographic ordering: d = {di 2 , dis, • • •, din, ^ 23 , c? 24 , ■ ■ ■, dn-i,n)- 

We dehne, following Pauplin [PauOO]: 


Cij (t) 
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where I is the number of internal nodes (degree 3 vertices) in the path from leaf i to 
leaf j. 

The BME tree for the vector d is the binary tree t that minimizes d-c(f) for all binary 
trees on leaves S. The value of setting up the question in this way is that it becomes a 
linear programming problem. The convex hull of all the vectors c{t) for all binary trees 
f on S' is a polytope BME(S'), hereafter also denoted BME(n) or Vn as in [EHPY08] 
and [HHYll]. The vertices of Vn are precisely the {2n — 5)!! vectors c{t). Minimizing 
our dot product over this polytope is equivalent to minimizing over the vertices, and 
thus amenable to the simplex method. 

In Figure 2 we see the 2-dimensional polytope P 4 . In that hgure we illustrate a 
simplifying choice that will be used throughout: rather than the original fractional 
coordinates we will scale by a factor of 2 "“^, giving coordinates Xij = = 

2 n- 2 -i^ Since the furthest apart any two leaves may be is a distance of n — 2 internal 
nodes, this scaling will result in integral coordinates. 

A clade is a subgraph of a binary tree induced by an internal (degree three) node 
and all of the leaves descended from it in a particular direction. In other words: given 
an internal node v we choose two of its edges and all of the leaves that are connected to 
V via those two edges. Equivalently, given any internal edge, its deletion separates the 
tree into two clades. Two clades on the same tree must be either disjoint or nested, one 
contained in the other. A cherry is a clade with two leaves. We often refer to a clade 
by its set of (2 or more) leaves. A pair of intersecting cherries {a, h} and {h, c} have 
intersection in one leaf b, and thus cannot exist both on the same tree. A caterpillar is 
a tree with only two cherries. 
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2. New Results 

A facet of a polytope is a top-dimensional face of that polytope’s boundary, or a 
CO- dimension-1 face. Faces of a polytope can be of any dimension, from 0 to that of 
the (improper) face which is the polytope itself. Our main results are to describe many 
new faces, especially facets, of the balanced minimal evolution polytope 

For n = 5 we completely classify the facets according to combinatorial type of their 
vertices. There are three classes of facet for n = 5, which we refer to as intersecting- 
cherry facets, caterpillar facets, and cyclic-ordering facets. There are respectively 30, 
10 and 12 of these types of facet in V 5 . 

In Theorem 4.3 we show that any pair of intersecting cherries corresponds to a facet 
of Vn- In Theorem 4.1 we (redundantly, for demonstrative purpose) show a special case 
of this facet for n = 5 and in Theorem 4.2 we show that for u = 5 these facets turn out 
to be equivalent to Birkhoff polytopes. 

In Theorem 6.3 we show that any caterpillar tree with hxed ends corresponds to a 
facet of Vn- For n = 5 we show in Theorem 6.2 that this facet is a Birkhoff polytope. 

In Theorem 5.1 we show that, for n = 5, for each free cyclic ordering of leaves there 
is a corresponding facet which is combinatorially equivalent to a simplex. 

The right half of Table 1 summarizes these new results. 

First though, in the next section, we go over some previously discovered facts about 
the edges and faces of the BME polytopes. Our contribution there is Theorem 3.1, in 




U U U i\ (UU uui, i\ 
42442 4^ V 4 4224 4^ 


( 2 , 1 , 1 , 1 , 1 , 2 ) 



( 1 , 2 , 1 , 1 , 2 , 1 ) ( 1 , 1 , 2 , 2 , 1 , 1 ) 


Figure 2. The polytope V 4 is a triangle. At the top we label the vertices 
with the three binary trees with leaves 1... 4. Each edge shows a nearest- 
neighbor interchange; for instance the exchange of leaves 3 and 4 on the 
bottom edge. At bottom left are Pauplin’s original coordinates and at 
bottom right are the coordinates, scaled by 2”“^ = 4, which we will use. 
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which we show that clade-faces can never be facets. We also take the opportunity to 
advertise future directions for the research. 

3. Edges, Clade-faces and future goals 

Known results about the BME polytope are closely related to several algorithms 
used to determine optimal phylogenetic trees. Of course with a reasonably small set of 
species or individuals one could simply create the entire (hnite) space of all the possible 
binary trees t with those species as the leaves, calculating the dot product d • c(t) for 
each one and then choosing the optimal tree as the one minimizing this product. Since 
this procedure would take far too long (it is NP-hard, as pointed out in [Day87] and 
[FJ12]) as soon as the size of the set grows beyond a certain point, we are interested in 
shortcut approaches. Two of these are the fastME algorithm and the neighbor joining 
algorithm. The former is introduced in [DG02] and the latter is developed in [SN87]. 

In [GS06] the authors show that neighbor-joining is a greedy algorithm for the BME 
method. The fastME algorithm however operates by searching the space of binary trees. 
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are found in [Hug08] and [HHYll]. The inequalities are given for any 
a, 6, c, • • • G [n]. Each can be translated to an inequality in the coordi¬ 
nates Cij simply by dividing the right hand side by 2"“^. For instance, 
when n = 4, the second inequality becomes Cab + Cbc — Cac < 1/2. Note 
that for n = 4 the three facets are described twice: our inequalities are 
redundant. 
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moving from one to another via nearest-neighbor interchange moves. These moves are 
illustrated by the edges of the triangle in Figure 2. Thus one goal for further study 
of the BME polytope is a more complete description of its edges, in order to more 
fully realize the simplex method. In [HHYll] the authors show that any subtree-prune- 
regraft move is associated to an edge in the BME polytope. The study of the facets of 
the BME polytope which we begin here can be seen as an alternate path to hopefully 
even better approximations of the simplex method. 

Since the total number of facets grows so quickly (90262 for n = 6 and beyond our 
computational patience for n = 7) and since the problem is iVP-hard, we doubt that 
a complete description of facets will be easy to find. Even if it was found the simplex 
method on all these facets may be infeasible. In the current work our stated desire to 
completely characterize the face structure of the BME polytope must be taken in this 
light: any advances are valuable despite the fact that we may be on an endless journey. 
The value of this knowledge is in its potential application, via the following conjecture: 
there is a subset of facet inequalities of the BME polytope (as found in this paper and 
its sequel) which will give us a useful relaxation of the BME polytope. 

In fact we conjecture that with just a fraction of the list of facet and face inequali¬ 
ties of the BME polytope we can describe a larger, enveloping polytope which recovers 
most or all of our original integral BME vertices, plus additional vertices with detectably 
incorrect coordinates. Should the conjecture hold, the inequalities we use for the relax¬ 
ation could be generated as needed in a branch-and-bound algorithm, halting when one 
of our powers-of-2 vertices is returned. There is possible potential for gains in speed 
over the existing algorithms, subject of course to testing. 

In the sequel to this paper we describe facets and faces based on trees that display 
a given split. A split of the set S of leaves for our BME trees is a partition of S into 
two non-empty parts. Si and 82 . A tree displays a split if Si makes up the leaves of a 
clade. {S 2 will make up the leaves of another clade.) 

In [HHYll] it is proven that any set of disjoint clades is associated with a specific 
face of the BME polytope. The clade-faces turn out to be combinatorially equivalent 
to smaller-dimensional BME polytopes. Precisely, given a collection of k clades using 
disjoint subsets of S as leaves, the face of Vn corresponding to this clade will be com¬ 
binatorially equivalent to Vn-y+k where y is the total number of leaves in the k clades. 
That is, this face will be itself a BME polytope, equivalent to one based on a set of 
n — y + k leaves. The k clades play the role of leaves, since they are fixed. Any vertex of 
this face can be described as a binary tree with n leaves such that all k disjoint clades 
are present. However, these clade-faces fail to describe any of the facets of the BME 
polytope. 

Theorem 3.1. If n > A then no clade face of Vn is a facet ofVn - 

We expect this to be true since the largest dimension clade-face would be that asso¬ 
ciated to a single cherry: the smallest clade. Here is a proof that takes a more general 
approach. 

Proof. Since a face of Vn corresponding to a disjoint set of k clades containing a total of 
y leaves is combinatorially equivalent itself to a smaller BME polytope, its dimension is 
that of the polytope Vn-y+k- Now, a facet of a BME polytope has dimension ( 2 ) —u — 1, 
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for n leaves. Thus if a facet was described by a disjoint set of k clades containing a 
total of y leaves, we could say that 

fn\ ^ fn - y + k\ . ,, 

This equation implies the quadratic equation + (2n — 3)p + 2 = 0, where p = 
k — y must be a negative integer. The roots occur at p = —n + y^/2 + 3/2 where 
q = 4n^ — 12 n + 1 . 

So for p to be an integer, we need y/g to be an odd integer, so q is the square of an 
odd (positive) integer 2 m — 1. 

Thus 4n^ — 12n + 1 = (2m — 1)^ = 4m^ — 4m + 1 for integer m > 0. 

Subtracting the I’s and dividing by 4 we get: 

n{n — 3) = m(m — 1). 

Letting n = i + 1 for z > 1 we see that n{n — 3) = i{i — 1) — 2. Thus any term in 
the sequence of integers m(m — 1 ) for m > 1 will always be equal to n(n —3)+2 = 6 „ + 2 
for some n. Since n{n — 3) increases faster than by simply adding 2 , the term am in 
question cannot be equal to any term after bn (nor any before, since both sequences are 
increasing.) 

In fact the only time that the equation can hold is for m = 1 and n = 3. □ 

This negative fact of course raises the question of how to characterize and describe the 
facets of Vn- We would eventually like a complete description, both combinatorially 
and geometrically. On the combinatorial side we would like to know which sets of 
vertices are those of a facet, and what other polytopes and constructions of polytopes 
(products, sums, pyramids, polars) those facets are equivalent to. On the geometrical 
side we would like to know how to quickly hnd the list of facet inequalities that describe 
Vn- In Figure 3 we show the data for n = 5. 

4. Facets from intersecting cherries. 

The hrst type of facet of V 5 that we found is associated to any pair of elements of 
S' = [5], along with a third element chosen after the pair. Thus there are ( 2 ) (3) = 30 
of these facets. Each of these facets has its set of vertices as follows: 

Theorem 4.1. For each pair of cherries with leaves {a,h} and {b,c}, where the pair 
a,c and the element of intersection b are three distinct elements from S' = [5], there is 
a facet of V 5 whose six vertices correspond to trees that have one of the two cherries. 

Figure 4 shows the geometry of an intersecting-cherry facet of V 5 . 

Proof. There are six total vertices since given one of the pair of cherries there are 3 
trees which have that cherry, since there are 3 elements to choose from to make the 
lone leaf. To show that these six vertices are the vertices of a face we need to hnd a 
linear inequality satished by all the vertices of V^ which becomes an equality only for 
the specihed six vertices. Then to show that the face is a facet we need to show that 
its dimension is one less than the dimension of the entire polytope P 5 . First we show 
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that the trees which have either a cherry with leaves {a, b} or with leaves {b, c} have 
associated points obeying: 



This equation holds for our trees since if {a, b} is the cherry then Xab = 4 and 
Xac = Xbc- Likewise if {6, c} is the cherry then Xbc = 4 and Xac = Xab- 

Now we need to show that for any vertex that has neither of our pair of cherries, 
then that vertex satishes: 


Xab + Xbc - Xac < 4 . 


This inequality holds since having neither cherry with leaves {a, b} nor with leaves 
{b, c} implies that Xab < 2 and Xbc < 2, while we know that Xac > 1- 

To see that our face is a 4-dimensional facet, we show that it contains a flag of 
subfaces (sequence of faces each contained in its successor) which is of length 5. We 
can proceed starting with any vertex and edge, since when n = 4 any pair of vertices 
have an edge between them. Our flag chosen for the purposes of this proof is shown in 
Figure 5, left to right with the vertex and edge hrst. We choose any vertex, but then 


x(0 = (l,4, 1,2, 1,4, 2,1,2, 2) 


40 = (1,4, 2, 1,1,2, 4, 2,1,2) 


t = 


t = 




{(4,1,1,2,1,1,2,4,2,2) 

(4.2.1.1.2.1.1.2.2.4) 

(4,1,2,1,1,2,1,2,4,2) 

(2.1.4.1.2.2.2.1.4.1) 

(2.2.2.2.1.4.1.1.4.1) 

(1.4.1.2.1.4.2.1.2.2) 

(1.2.1.4.2.4.1.2.2.1) 

(2.1.1.4.2.2.2.4.1.1) 

(1.1.2.4.4.2.1.2.1.2) 

(1.1.4.2.4.1.2.1.2.2) 

(2.2.2.2.4.1.1.1.1.4) 

(2.4.1.1.2.2.2.1.1.4) 

(1.4.2.1.1.2.4.2.1.2) 

(1,2,4,1,2,1,4,2,2,1) 



Figure 3. Two sample vertex trees of with their respective coor¬ 
dinates shown beneath, followed by all 15 vertex points calculated for 
n=5, and the /-vector for V 5 (which gives the number of faces in each 
dimension starting with 0) as found by polymake [GJOO]. 
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choose an edge which connects that vertex with another that shares with the hrst one 
of onr special cherries, say {b, c}. 

Next, the dimension 2 subface in our flag is formed by adding the third vertex that 
also contains the cherry {b,c}. These three vertices form a clade face-the clade is the 
cherry. 

The dimension 3 subface is found by adding a fourth vertex whose tree has both 
cherries {a,b} and {c,/}. Together these four make a face: all four points obey the 



Figure 4. A generic facet of P 5 with each vertex labeled by a tree which 
contains one of two intersecting cherries: {a,b} and {b,c}. The dashed 
edges outline the clade-faces (triangles) associated with those two cher¬ 


ries. 
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equation xm — Xcd = 0. The last two remaining trees in the facet are forced to obey 
Xbd - Xcd < 0. 

□ 

Consider an u x n matrix as a vector with components. Taking the convex hull 
of the n\ permutation matrices gives a polytope known as Bn, or B{n), the Birkhoff 
polytope, or assignment polytope, of order n. This polytope has dimension [n — 1)^, 
and appears in many situations, as seen in [BS96]. Here it appears again: 

Theorem 4.2. The intersecting-cherry facets of V 5 are combinatorially equivalent to 
the Birkhoff polytope of dimension f. 

Proof. This fact was hrst verihed by polymake for a specihc intersecting-cherry facet, 
where the isomorphism of vertices can be seen as preserving vertex-facet incidence. 



Figure 5. Left to right these columns show the sets of trees in faces 
that form a flag of a facet of V 5 based on two intersecting cherries. 
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Since all the intersecting-cherry facets are combinatorially equivalent by a common 
permutation of the coordinates, checking one is sufficient for checking all. To see the 
isomorphism compare the two Schlegel diagrams: from Figure 4 and from Figure 9. 


□ 


We leave open for future study the question of how one might determine the general 
isomorphism between an intersecting-cherry facet and the 4d-Birkhoff polytope. It 
would also be quite interesting to see if the relationship extends to higher dimensions. 
For now we only say the following for the general case of 

Theorem 4.3. Each pair of intersecting cherries from S = [n] corresponds to a certain 
facet of the polytope Vn- 


Proof. The trees which have either a cherry with leaves {a, b} or with leaves {b, c} have 
associated points obeying: 

^ab T ^hc ^ac 2 


This equation holds for our trees since if {a, b} is the cherry then Xab = 2"'“^ 
Xac = Xbc- Likewise if {b, c} is the cherry then Xbc = 2"'“^ and Xac = Xab- 

For any vertex that has neither of our pair of cherries, then that vertex satisfies: 

XqJj T ^bc ^ac ^ 2 


and 


This inequality holds since having neither cherry with leaves {a, 6} nor with leaves 
{6, c} implies that Xab < 2”“'^ and Xbc < 2"'“^, while we know that Xac > 1- 

So far we have shown that the collection of trees with either cherry {a, 6} or {6, c} 
forms a face Pabc with inequality Xab + Xbc — Xac < 2"'“^. Next we show that the face 
Pabc is in fact a facet, of dimension ( 2 ) — n — 1. The strategy is to show existence of a 
flag of Vn beginning with Pabc and ending with a single vertex, which has total length 
( 2 ) ~ start with a chain of sub-faces of Pabc which has length n — 3, including 

Pabc itself. Then we show a hnal sub-face which has dimension (" 2 ^) — (n — 1). Thus 
the entire flag is of length ("'2 — (n — 1) -|- 1 -|- n — 3 = ( 2 ) ~ We have illustrated 

this flag in Figure 6. 

After Pabc the largest face in our flag is the one whose vertices are described as each 
corresponding to a tree that has either the cherry {a, b} or has both the cherry {b, c} 
and the cherry {a, /}. We call this face Pabcj- The vertices of Pabcj obey the equality: 

Xbc T Xbf Xac b* 

The trees with cherry {a, b} have equal distances from those two leaves, while the 
remaining trees have coordinates Xbc = Xaf and Xbf = Xac- Trees that have cherry {b, c} 
but not cherry {a, /} obey the inequality: 


Xbc T Xbf Xac Xaf ^ 0* 

This is true since the tree cannot have the cherry {a, c} either, so Xac + Xaf < 2”'“^. 

Next, given an ordering {yi, y 2 ,..., yn- 4 ) of the leaves not including a, b, c or /, there is 
a sequence of n — 5 sub-faces called Plbcji Pabcji ■ ■ ■ i ^ahcj- The face Pabcj vertices 
described as each corresponding to a tree that has either the cherry {a, b} or is a 
caterpillar that has both the cherry {a,/}, at one end, and the cherry {b,c} at the 
other. Between the cherries in the caterpillar are the leaves yi,... ,yk in that order 
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beginning closest to the cherry {a,/}. The remaining leaves yj for j > k fill in the 
caterpillar in any order; note that there are n — 5 such collections since the last leaf is 
determined when we reach See Figure 6. 

The vertices of Pabcj obey the linear equality: 

(2^-3 - l)xay, - (2"-3 - l)xby, + (2^-3-^ - 2 '^)xab = (2"-="-" - 2 ^)2^-\ 
or, more conveniently, 

(2^-3 - l)ix.y, - Xby,) = {2^-^ - XabW-^-'^ - 2% 

The equality is clear for trees that have the cherry {a,b}, since it becomes 0 = 0. For 
caterpillar trees of the face we have Xab = 1 and Xay^. — Xby^. = — 2^. 

We need to show that for trees in Pabcj that are not in Pabcj we have the inequality: 

(2"-3 - l){Xay, - Xby,) < (2"-3 - XabW-^-^ - 2^). 

There are two cases: 

Case 1) If the tree is a caterpillar, with leaf yi more than two nodes from leaf a, then 
the inequality follows from Xah = 1 and Xay^ — Xby^ < 2"'“^ — 2. 

Case 2) If the tree is not a caterpillar, we show the inequality by induction on the 
number n of leaves. We check the base case n = 6, where the inequality becomes 
0< (2^ - 2)(2^ - 2). 

Assuming the inequality for m < n, then in the case for n leaves we choose a cherry 
and replace it with a single leaf. There are two subcases of Case 2. 

Subcase (i): If leaf yi was in the chosen cherry then we call the replacement leaf yi 
instead, and by induction we have the inequality: 

(2-1-3 _ l)(xay, - Xby,) < (2"-i-3 - Xabl2){2^-^-^ - 2). 

where the values for the coordinates mentioned are the same as in the n-leaved tree 
before replacement. 

Multiplying by 2, we get: 

(2-3 _ 2){xay, - Xby,) < 2(2-i- 3 - x,b/2)(2-5 - 2) 

= (2-3-x,b)(2-5-2). 

Expanding on the left and then adding to both sides gives: 

(2n-3 _ l)(^Xay^ - Xby^) < (2"-3 - Xab){2'^~^ “ 2) + Xay^ “ Xby^ 

Expanding on the right via 2^~^ = 2'^~‘^ — 2"'-®, we get: 

(2-3 - l){Xay, - Xby,) < (2-3 - Xab){2^-^ - 2) - 2-5(2-3 - X^b) + X^y, - Xby, 

= (2-3 - Xab){2^-^ - 2) - (2-^)2 + 2^-^Xabl2) + Xay, - Xby, 

Using the facts that, for our non-caterpillar tree, we know Xay^ — Xby^ < 2”-"i — 2 and 
Xab < 2"“'^, we get: 

(2-3 - l){xay, - Xby,) < (2-3 - Xab)(2-4 - 2) - (|(2-4)2 - 2-^ + 2). 

The last term is a polynomial in 2"-4 whose minimum is 3/2 when n = 4. Thus it 
can be dropped to achieve the desired inequality. 

Subcase (ii) If leaf yi is not in the chosen cherry then by induction we have the 
inequality: 

(2-1-3 _ l){xayj2 - Xb^j2) < (2"-4-3 - x,b/2)(2"-4-4 - 2). 
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where the values for the coordinates mentioned are the same as in the n-leaved tree 
before replacement. 

Multiplying by 4, we get: 

(2"-3 - 2){Xay, - Xby,) < (2-3 - Xab){2^-^ - 4) 

Expanding on the left and then adding to both sides gives: 

(2-3 _ < (2-3 _ - 2) - 2(2-3 _ 

Using the facts that, for our non-caterpillar tree, we know Xayi — Xbyi < 2”“'^ — 2 and 
Xab < we get: 

(2-3 - l){xay, - Xby,) < (2-3 - Xafe)(2-4 - 2) - (2-2 - 2-3 - 2-4 + 2) 

= (2—3 - Xab)(2—4 - 2) - (4(2—4) _ 2(2—4) _ 2^-4 ^ 2) 

= (2—3 - Xab) (2—4 - 2) - (2—4 + 2). 

Since the last term is greater than 2, it can be dropped to achieve the desired in¬ 
equality. 

Next we need to check that for trees in Pabcj that are not in Pabc/ have the 
inequality: 


(2"-=> - < (2“-3 - i„,)(2"-='-'= - 2‘). 

This inequality is straightforward, since Xab = 1 and since the leaf yk is forced to be 
closer to leaf b and further from leaf a, if it is not in the position. 

□ 



Figure 6. Each set is the collection of trees (vertices of Vn) that have 
any placement of the remaining labels from S into the blank leaves shown. 
The containment of sets also shows the flag of our facet Pabc- The con¬ 
tainment is Pabc 3 Pabcj D PLj ^ Pabcj ^ ^ P'bcJ ^ Pab- 
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The facet inequalities we describe in the above proof are equivalent to the triangular 
inequalities of Proposition 3 in [CLPSG12]. This connection does raise the question of 
whether other inequalities in that paper can lead to facets of the BME polytope. 

Note that the dimension of a general face corresponding to an intersecting pair of 
cherries is greater than the dimension of the clade-face for the clade that is one of those 
cherries. We conjecture based on initial experiments (verihed by polymake for n= 6 ) 
that in fact the dimension is [^'j—n — 1, implying that these intersecting-cherry faces are 
indeed facets. We also leave for the future the investigation of other sorts of intersecting 
sets of clades: we conjecture that two or three or more clades, of various sizes and tree 
geometry, intersecting in various ways, will lead to further faces and facets of 

5. Facets from free cyclic orderings. 

A free circular permutation (or free cyclic ordering, or necklace) of the elements of 
S is only distinguished by which elements are adjacent. It is an arrangement of the 
elements of S around a circle, which may be rotated or flipped. We are interested in 
the binary trees on S which are coplanar with a certain free cyclic ordering on S. That 
is, having drawn one of the two planar versions of the cyclic ordering, we can then draw 
the tree in the same plane, as in Figure 7. The number of trees coplanar with the free 
cyclic ordering is found by a simple counting argument for n = 5 : there are 5 choices 
for the hrst cherry and the two for the second cherry; but then we divide by two since 
the order we choose the cherries in is irrelevant, giving us hve total trees. 

Theorem 5.1. For each free cyclic ordering M on S with IS"! = 5 there is a facet of 
V5 that is equivalent to a f-simplex, whose five vertices correspond to trees that are 
coplanar with the free cyclic ordering. 

Proof. The trees which are coplanar with J\f = (a, b, c, d, f) satisfy Xab + Xbc + Xcd + 
Xdf + Xfa = 13. That is because two cherries are represented by those components, and 
the remaining components are assigned values 2 , 2 and 1 respectively. All other trees 
in V5, not coplanar with Af, obey Xab + Xbc + Xcd + Xdf + Xfa < 13. That is because at 
most one cherry can be among those components, and the rest of the components then 
can at most be assigned the value 2. Thus the components add up to at most 12. 

Thus our hve trees constitute the vertices of a face of V5. We show that this is a facet, 
with dimension equal to 4, by establishing within it a hag of length 5. We can proceed 
starting with any vertex and edge, since any pair of vertices have an edge between them. 
Our hag chosen for the purposes of this proof is shown in Figure 8 , left to right with 
the vertex and edge hrst. The set of three vertices makes a triangular face of the facet 
since they obey the equality Xbf = 1 while the other two vertices have Xbf > 1. The set 
of four vertices make a 3-simp lex since they all obey the equality Xbf + Xbd + Xac = 4:, 
while the hnal vertex has Xbf + Xbd + Xac > 4. 

□ 

The number of free cyclic orderings on 5 objects is 4!/2 = 12. Via polymake we see 
that there are exactly 12 facets of P 5 that have 5 vertices. Thus we have accounted for 
all of these facets with free cyclic orderings. 
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6 . Facets from Caterpillars. 

The third type of facet for corresponds to a choice of two elements of S. These are 
placed as leaves on a tree that are as far apart as possible: in this case a distance of 3 
internal nodes on a binary caterpillar. Thns there are six ways to place the remaining 
three elements of S as the other three leaves, and the caterpillar facet has 6 vertices. 

Theorem 6.1. Each pair of elements a, b from S with IS"! = 5 determines a facet ofV^ 
whose vertices are trees that have a and b as leaves of distinct cherries. 

Proof. The result follows from the general Theorem 6.3 which establishes the fact for 
all dimensions. Specihcally, each tree that has the elements a and b separated by 3 
internal nodes has corresponding vector that obeys Xah = 1- All other trees, which do 
not have this property, obey Xab > 1- The flag of length 5 which establishes that the face 
in question is indeed a facet is described inductively in the proof of Theorem 6.3. □ 

The number of these facets in is ( 2 ) = 10. Note that now we have described 
30 + 12 + 10 = 52 facets of Ps, the total number predicted by polymake. The three 
classes of facets do not intersect for n = 5: that is, a caterpillar facet cannot be 



Figure 7. A generic facet of P 5 based on a free cyclic ordering (a, b, c, d, /). 






BME FACETS 


15 


an intersecting cherry facet (nor vice-versa), since the collection of caterpillar trees is 
determined by choosing two leaves to be as far apart as possible, while the intersecting 
cherry trees allow any two leaves to be closer than the maximnm distance. 

Theorem 6.2. The caterpillar facets ofV^ are combinatorially equivalent to the Birkhoff 
polytope of dimension f. 

Proof. This fact was hrst verihed by polymake, where the isomorphism of vertices may 
be seen as preserving vertex-facet incidence. Since all the caterpillar facets are com¬ 
binatorially equivalent by a common permutation of the coordinates, checking one is 
sufficient for checking all. We illustrate the isomorphism by showing the two Schlegel 
diagrams in Figure 9. □ 

The generalization of this caterpillar facet for any size set S has vertices any collection 
of trees with leaves S that are all binary caterpillars, with a pair of chosen species as 



Figure 8 . Left to right these columns show the sets of trees in faces 
that form a flag of a facet of P 5 based on a free cyclic ordering (a, b, c, d, /). 
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the two which must reside as far apart as possible-as leaves of the only two distinct 
cherries. These faces are indeed facets of Vn, each with (n — 2)! vertices. In general 




Figure 9. On the top is the Birkhoff polytope B{3) with vertices labeled 
by the permutation matrices. On the bottom is a facet of V 5 with each 
vertex labeled by the tree corresponding to the permutation matrix in 
the corresponding position. 
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they are not equivalent to the Birkhoff polytope B{n — 2), since for n > 5 the facets 
of Vn have a dimension ( 2 ) ~ ~ 1 which is greater than (n — 3)^ (the dimension of 

B{n — 2).) However an interesting projection is suggested by the 4-dimensional case in 
Figure 9, where a permutation matrix corresponding to permutation a is mapped to the 
tree with leaves 3,4, and 5 in the order (t(3), (t(4), (t( 5). We leave as an open question, 
for instance, whether this map in the general case gives rise to a cellular projection to 
the Birkhoff polytope from facets of the balanced minimal evolution polytope. 

Theorem 6.3. Consider the set of binary caterpillar trees on n leaves S with a given 
pair from the set S as maximally separated leaves. The vertices of Vn calculated from 
these caterpillar trees are the vertices of a facet ofVn- 

Note that the number of these facets in Vn is ( 2 ) for n > 4. For n = 4 there are 
half that many, the three edges of the triangle in Figure 2, since having chosen two 
elements of S to be placed in the distinct cherries we automatically determine the other 
two elements which will also be placed in distinct cherries: in the notation of the proof 
that follows we have for instance that Pj ^2 = -^ 34 - 

Proof, of Theorem 6.3 For the purposes of this proof we choose the set S = [n] and 
without loss of generality we let the two hxed leaves with maximal distance n—2 between 
them be the leaves labeled 1 and 2. We’ll continue by choosing leaves in counting order: 
this will be without loss of generality since any other selection of leaves is covered by 
choosing an appropriate ordering. 

Thus the caterpillar trees with hxed leaves 1 and 2 obey X 12 = 1, and all other points 
in Vn obey 0:12 > 1. We call this face Next we use induction on u, the number of 
leaves, to show that the face is in fact a facet, of dimension (”) — u — 1. The strategy 
is to show existence of a hag of Vn beginning with P{^ and ending with a single vertex, 
which has total length ( 2 ) ~ We start with a chain of sub-faces of P{^ which has 
length n — 2, including P{^ itself. Then we show a hnal sub-face which has the same 
dimension as Pfff^, thus inductively of dimension ("' 2 ^) — (u — 1) — 1. Thus the entire 
hag is of length — (n — l)+n — 2 = ( 2 ) — n. 

The base case of our induction is n = 4. See Figure 2 where each edge of the triangle 
is a facet of this type. Specihcally, the edge Pj ^2 is at the bottom of the triangle. 

After Pf 2 the largest face in our hag is the one whose vertices are described as vertices 
of Pf 2 whose caterpillar tree is one of two types, as seen in Figure 10. The tree has a 
third hxed leaf, say the leaf labeled 3, either in the same cherry as the leaf 1; or as the 
leaf nearest that cherry but not in it. We call this face Pi’^ 3 , and note that it contains 
2{{n — 3)!) vertices. To see that it is indeed a face, we show that its vertices obey the 
equation 


Xl 3 


=2 


n—3 


, 2=4 


...and that all other vertices in Pf^ obey the inequality: 


2:13 


2 "“" > 2 


n—3 


, 2=4 
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First, the vertices of whose caterpillar tree has the leaf labeled 3 in the same 
cherry as the leaf 1 : for these the difference Xu — Xs, = 0 for each i, while Xi^ = 2 "'“^. 
For the vertices that have leaf 3 as the leaf nearest the cherry containing leaf 1, but not 
in it: Xi^ = 2 ”“'^ and the sum of differences telescopes and simplihes to equal = 1 . 
The equality holds since 2(2”“^) = 2"“^. 

Any other leaf of not in P{^ 3 has leaf 3 even further from the cherry containing 
leaf 1. Now the sum of differences will telescope and simplify to become 1 + 2 + ■ • ■ + 2-^ 
where j is the number of leaves further (than 1) from the cherry that leaf 3 is found. 
Since the latter sum is larger than 2 , the left side of our inequality is greater than 
Next we describe a sequence of n — 4 nested faces (of steadily smaller dimension) 
labeled Pi^ 34 , Pi 2 , 345 , • • •, -^ 12 , 345 ,,,*; for A: = 4 ... n — 1. The vertices of Pi ^_34 (the hrst in 
this series, with largest dimension) are vertices of P{^ 3 which either have leaf 3 in the 
cherry with leaf 1, or have leaf 4 in the cherry with leaf 1. After that, for /c > 4 the 
vertices of Pj^ 34 ^ are vertices of P [^34 with either leaf 3 in the cherry with leaf 
1 or leaf 4 in the cherry with leaf 1 and leaves 5 ... /c in that order immediately on the 
other side of leaf 3. See Figure 10. 

First we show that the vertices of Pi’^ 345 ,,,fc obey the equality: 

\ i=4 

Consider the vertices of P/^ 345 ,,,^ whose caterpillar tree has the leaf labeled 3 in the same 
cherry as the leaf 1 : for these the difference Xu — 0 : 3 , = 0 for each i, while 0:13 = 2 "'“^. 
For the vertices that have leaf 3 as the leaf nearest the cherry containing leaf 1, but 
not in it: X 13 = 2 "'“^ and the sum of differences telescopes and simplihes to equal 2 ”“^. 
The equality holds since 2(2"'“'^) = 2"“^. 

To check for the needed inequalities we begin with k = A. We show that the vertices 
of P 12 3 which are not in P(^ 34 obey the inequality: 

2"-'^xi3 + 2 "-" (5^(xn-a;30 
\i=4 

For k = A, and since these trees have leaf 3 as the leaf nearest the cherry containing 
leaf 1, but not in it, and leaf 4 also not in that cherry, this inequality becomes: 

2”-42"-4 + 2 ”-^ (xm - 0 : 34 ) < 

This inequality holds since 0:14 — X 34 < 0, and so 2"'“'^ + (xi 4 — xu) < 2”“'^ < 2"'“^, 
which leads to the desired inequality. 

Now, for k>A,we need to show that the vertices of P{^ 3 which are not in P(^ 3 ^ 
obey the inequality: 

k 

^(Xii - X3i) 

i=A 





2^-^xi3 + 2 ”“^ 
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Since these trees have leaf 3 as the leaf nearest the cherry containing leaves 1 and 4, 
this inequality becomes: 






, i=4 


Since leaf /c is at a position farther from leaf 1 than if the tree was in P 123 ki 
the sum of differences telescopes and simplihes to 2 "“^+^ — 2 ^ where x < n 
X — {n — k) < 0 and it is clear that 1 + 2 — > 2. Therefore: 

2^n—k2^n—4 _j_ 2^n—42^n—k^2^ _ 2^x—{n—k)'^ ^ Qn—^Qn—k 


then 
k. Thus 


which is the simplihed inequality we needed to show. 

Finally we reach the subface called P 123 , which consists of the vertices of Pi 2 3 i^,,,n-i 
which have leaf 3 in the same cherry as leaf 1. That of course means they are only 
of the hrst type. They constitute a subface, since they obey the additional equality 
Xi 3 = 2"'“^, while the other vertex of T’" 2 , 345 ...n-i obeys X 13 = 2”“^ < 2”“^. We check 
that this face projects to PiV^; which as discussed above will give us the correct 
number of remaining faces of our flag, by induction. The linear projection is described 
by its action on the ( 2 ) coordinates, yielding ("' 2 ^) new ones; thus it is given by an 
("^2 ^ ( 2 ) which has rows and columns labeled by the respective coordinates 

in lexicographic order. 

First each coordinate involving leaf 3 (so Xi^ or x^i) is discarded. That means the 
columns of A corresponding to these coordinates are made up of zeroes. Second, the 
coordinates that involve leaf 1 (so Xn or xu) are multiplied by 1 , but sent to the 
new coordinates with the same label if i = 2 , or to the coordinates or Xi(j_i) 

respectively if i > 4. Thus the columns of A corresponding to these coordinates have a 
single entry of 1 in the row corresponding to the new coordinate. Finally the coordinates 
Xij involving neither leaf 1 nor leaf 3 are multiplied by 1/2 and sent to the new coordinate 
Xiiji where i' = i — 1 for i > 2 and i' = i for i = 2] and likewise for Thus the columns of 
A corresponding to these coordinates have a single entry of 1/2 in the row corresponding 
to the new coordinate. For instance here is the matrix A for u = 5; it takes the two 
sample vectors listed in Figure 3 (the two vertices of the lowest edge in Figure 9) to the 
two vectors labeling the lower edge of the triangle in Figure 2. 


'10000 0 0 00 0 ' 

00100 0 0 00 0 

00010 0 0 00 0 

00000 1/2 0 00 0 

00000 0 1/2 00 0 

.00000 0 0 00 1/2 . 

We claim that the image of P {^3 under A is the polytope Pi 2 ^- This follows from the 
fact that the projection A induces a 1-1 and onto mapping between the vertices of the 
two polytopes. Since there are (n —3)! vertices of each polytope, the surjective property 
of the mapping implies that it is a bijection. We can easily describe the preimage of 
a vertex in Pi<f^ : take its caterpillar tree, attach a new branch as close to leaf 1 as 
possible, and give it leaf 3. Then add 1 to increment each of the other leaves except for 
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leaf 2. The resulting new tree has the coordinates required. The coordinates involving 
leaf 3 will be discarded, so their value can be ignored. The leaves in our new tree are 
all now one node further away from leaf 1, but using the new total number of leaves n 
this difference is canceled. The coordinates involving neither leaf 1 nor leaf 3 are the 
same except for the factor of 2. 

□ 
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Figure 10. Each set is the collection of trees (vertices of Vn) that have 
any placement of the remaining labels from S = [n] into the blank leaves 
shown. The containment of sets also shows the flag of our facet The 
containment is D 3 D ^ 12,34 T* -^ 12,345 Y • • • D Pi 2 , 345 ...n-i T* -Pl 23 - 
Note that the next to last set also has leaf n fixed since there is only one 
place for it, and the last set (bottom right) is the set of vertices of Vn 
which label a face that projects to the facet PiV^- 
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